Search CORE

4 research outputs found

GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding

Author: Bout Andrey
Nikolenko Sergey
Piontkovskaya Irina
Podolskiy Alexander
Yakovlev Konstantin
Publication venue
Publication date: 14/11/2023
Field of study

Grammatical error correction (GEC) is an important NLP task that is currently usually solved with autoregressive sequence-to-sequence models. However, approaches of this class are inherently slow due to one-by-one token generation, so non-autoregressive alternatives are needed. In this work, we propose a novel non-autoregressive approach to GEC that decouples the architecture into a permutation network that outputs a self-attention weight matrix that can be used in beam search to find the best permutation of input tokens (with auxiliary {ins} tokens) and a decoder network based on a step-unrolled denoising autoencoder that fills in specific tokens. This allows us to find the token permutation after only one forward pass of the permutation network, avoiding autoregressive constructions. We show that the resulting network improves over previously known non-autoregressive methods for GEC and reaches the level of autoregressive methods that do not use language-specific synthetic data generation methods. Our results are supported by a comprehensive experimental validation on the ConLL-2014 and Write&Improve+LOCNESS datasets and an extensive ablation study that supports our architectural and algorithmic choices.Comment: ACL 202

arXiv.org e-Print Archive

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Author: Artemova Ekaterina
Bout Andrey
Lipin Dmitry
Piontkovskaya Irina
Podolskiy Alexander
Publication venue
Publication date: 11/01/2021
Field of study

Real-life applications, heavily relying on machine learning, such as dialog systems, demand out-of-domain detection methods. Intent classification models should be equipped with a mechanism to distinguish seen intents from unseen ones so that the dialog agent is capable of rejecting the latter and avoiding undesired behavior. However, despite increasing attention paid to the task, the best practices for out-of-domain intent detection have not yet been fully established. This paper conducts a thorough comparison of out-of-domain intent detection methods. We prioritize the methods, not requiring access to out-of-domain data during training, gathering of which is extremely time- and labor-consuming due to lexical and stylistic variation of user utterances. We evaluate multiple contextual encoders and methods, proven to be efficient, on three standard datasets for intent classification, expanded with out-of-domain utterances. Our main findings show that fine-tuning Transformer-based encoders on in-domain data leads to superior results. Mahalanobis distance, together with utterance representations, derived from Transformer-based encoders, outperforms other methods by a wide margin and establishes new state-of-the-art results for all datasets. The broader analysis shows that the reason for success lies in the fact that the fine-tuned Transformer is capable of constructing homogeneous representations of in-domain utterances, revealing geometrical disparity to out of domain utterances. In turn, the Mahalanobis distance captures this disparity easily.Comment: to appear in AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Sinkhorn Transformations for Single-Query Postprocessing in Text-Video Retrieval

Author: Alimova Ilseyar
Bout Andrey
Nikolenko Sergey
Piontkovskaya Irina
Podolskiy Alexander
Polyakov Gregory
Yakovlev Konstantin
Publication venue
Publication date: 14/11/2023
Field of study

A recent trend in multimodal retrieval is related to postprocessing test set results via the dual-softmax loss (DSL). While this approach can bring significant improvements, it usually presumes that an entire matrix of test samples is available as DSL input. This work introduces a new postprocessing approach based on Sinkhorn transformations that outperforms DSL. Further, we propose a new postprocessing setting that does not require access to multiple test queries. We show that our approach can significantly improve the results of state of the art models such as CLIP4Clip, BLIP, X-CLIP, and DRL, thus achieving a new state-of-the-art on several standard text-video retrieval datasets both with access to the entire test set and in the single-query setting.Comment: SIGIR 202

arXiv.org e-Print Archive

Fluorinated counterion-enhanced emission of rhodamine aggregates: ultrabright nanoparticles for bioimaging and light-harvesting

Author: Adronov
Ahrens
Alivisatos
Andreas Reisch
Andrey S. Klymchenko
Battistelli
Berberan-Santos
Bronstein
Bwambok
Cametti
Castro
Chen
Colby
Dabbousi
de Rooy
Enseki
Farkas
Fery-Forgues
Genin
Genovese
Gladysz
Goncalves
Haugland
Hofkens
Hong
Hong
Horváth
Ievgen Shulov
Janjic
Jo
Jordan
Jordan
Kasha
Klymchenko
Koynov
Krossing
Larson
Lavis
Li
Li
Lin
Magde
Magut
Mayeul Collot
Michalet
Mottram
Ou
Pecher
Peng
Peng
Peng
Petkau
Qin
Rampazzo
Rao
Reisch
Schill
Schmidt
Shin
Sletten
Sule Oncul
Sun
Sun
Tang
Thomas Iii
Thompson
Trofymchuk
Vanden Bout
Vincent
Vu
Wagh
Wang
Ward
Woller
Wu
Wu
Würthner
Yao
Yao
Yao
Yao
Yeow
Youri Arntz
Yu
Yves Mely
Özhalıcı-Ünal
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2015
Field of study

Crossref